Dataset statistics
| Number of variables | 24 |
|---|---|
| Number of observations | 6743373 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.1 GiB |
| Average record size in memory | 175.0 B |
Variable types
| DateTime | 1 |
|---|---|
| Categorical | 13 |
| Text | 1 |
| Numeric | 9 |
dep_airport has a high cardinality: 350 distinct values | High cardinality |
dep_cityname has a high cardinality: 344 distinct values | High cardinality |
arr_airport has a high cardinality: 350 distinct values | High cardinality |
arr_cityname has a high cardinality: 344 distinct values | High cardinality |
distance_type is highly imbalanced (64.0%) | Imbalance |
delay_carrier is highly skewed (γ1 = 22.75351034) | Skewed |
delay_weather is highly skewed (γ1 = 45.85818526) | Skewed |
delay_nas is highly skewed (γ1 = 22.64626686) | Skewed |
delay_security is highly skewed (γ1 = 288.4636637) | Skewed |
dep_delay has 314586 (4.7%) zeros | Zeros |
arr_delay has 124753 (1.9%) zeros | Zeros |
delay_carrier has 5955483 (88.3%) zeros | Zeros |
delay_weather has 6671374 (98.9%) zeros | Zeros |
delay_nas has 6082588 (90.2%) zeros | Zeros |
delay_security has 6735225 (99.9%) zeros | Zeros |
delay_lastaircraft has 6032255 (89.5%) zeros | Zeros |
Reproduction
| Analysis started | 2024-06-23 17:30:48.083095 |
|---|---|
| Analysis finished | 2024-06-23 17:32:24.657387 |
| Duration | 1 minute and 36.57 seconds |
| Software version | ydata-profiling v4.8.3 |
| Download configuration | config.json |
flightdate
Date
| Distinct | 365 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 102.9 MiB |
| Minimum | 2023-01-01 00:00:00 |
|---|---|
| Maximum | 2023-12-31 00:00:00 |
day_of_week
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.9 MiB |
| Friday | |
|---|---|
| Thursday | |
| Monday | |
| Sunday | |
| Wednesday | |
| Other values (2) |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 7.116674 |
| Min length | 6 |
Characters and Unicode
| Total characters | 47990387 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Monday |
|---|---|
| 2nd row | Tuesday |
| 3rd row | Wednesday |
| 4th row | Thursday |
| 5th row | Friday |
Common Values
| Value | Count | Frequency (%) |
| Friday | 1003617 | |
| Thursday | 998353 | |
| Monday | 996628 | |
| Sunday | 984932 | |
| Wednesday | 951324 | |
| Tuesday | 937567 | |
| Saturday | 870952 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| friday | 1003617 | |
| thursday | 998353 | |
| monday | 996628 | |
| sunday | 984932 | |
| wednesday | 951324 | |
| tuesday | 937567 | |
| saturday | 870952 |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 7694697 | |
| a | 7614325 | |
| y | 6743373 | |
| u | 3791804 | |
| n | 2932884 | 6.1% |
| s | 2887244 | 6.0% |
| r | 2872922 | 6.0% |
| e | 2840215 | 5.9% |
| T | 1935920 | 4.0% |
| S | 1855884 | 3.9% |
| Other values (7) | 6821119 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 47990387 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| d | 7694697 | |
| a | 7614325 | |
| y | 6743373 | |
| u | 3791804 | |
| n | 2932884 | 6.1% |
| s | 2887244 | 6.0% |
| r | 2872922 | 6.0% |
| e | 2840215 | 5.9% |
| T | 1935920 | 4.0% |
| S | 1855884 | 3.9% |
| Other values (7) | 6821119 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 47990387 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| d | 7694697 | |
| a | 7614325 | |
| y | 6743373 | |
| u | 3791804 | |
| n | 2932884 | 6.1% |
| s | 2887244 | 6.0% |
| r | 2872922 | 6.0% |
| e | 2840215 | 5.9% |
| T | 1935920 | 4.0% |
| S | 1855884 | 3.9% |
| Other values (7) | 6821119 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 47990387 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| d | 7694697 | |
| a | 7614325 | |
| y | 6743373 | |
| u | 3791804 | |
| n | 2932884 | 6.1% |
| s | 2887244 | 6.0% |
| r | 2872922 | 6.0% |
| e | 2840215 | 5.9% |
| T | 1935920 | 4.0% |
| S | 1855884 | 3.9% |
| Other values (7) | 6821119 |
airline
Categorical
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.9 MiB |
| Southwest Airlines Co. | |
|---|---|
| Delta Air Lines Inc | |
| American Airlines Inc. | |
| United Air Lines Inc. | |
| Skywest Airlines Inc. | |
| Other values (10) |
Length
| Max length | 28 |
|---|---|
| Median length | 22 |
| Mean length | 19.99837 |
| Min length | 12 |
Characters and Unicode
| Total characters | 134856468 |
|---|---|
| Distinct characters | 37 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Endeavor Air |
|---|---|
| 2nd row | Endeavor Air |
| 3rd row | Endeavor Air |
| 4th row | Endeavor Air |
| 5th row | Endeavor Air |
Common Values
| Value | Count | Frequency (%) |
| Southwest Airlines Co. | 1421229 | |
| Delta Air Lines Inc | 972931 | |
| American Airlines Inc. | 928056 | |
| United Air Lines Inc. | 720031 | |
| Skywest Airlines Inc. | 664850 | |
| Republic Airways | 286487 | 4.2% |
| JetBlue Airways | 267915 | 4.0% |
| Spirit Air Lines | 258838 | 3.8% |
| Alaska Airlines Inc. | 242643 | 3.6% |
| American Eagle Airlines Inc. | 224692 | 3.3% |
| Other values (5) | 755701 |
Length
| Value | Count | Frequency (%) |
| inc | 4006504 | |
| airlines | 3925843 | |
| air | 2263128 | |
| lines | 1951800 | |
| southwest | 1421229 | 6.7% |
| co | 1421229 | 6.7% |
| american | 1152748 | 5.5% |
| delta | 972931 | 4.6% |
| united | 720031 | 3.4% |
| skywest | 664850 | 3.2% |
| Other values (11) | 2590678 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 15745526 | |
| 14347598 | 10.6% | |
| e | 12341228 | 9.2% |
| n | 12321555 | 9.1% |
| s | 8760767 | 6.5% |
| r | 8698780 | 6.5% |
| A | 8444261 | 6.3% |
| l | 6149361 | 4.6% |
| t | 6014907 | 4.5% |
| c | 5445739 | 4.0% |
| Other values (27) | 36586746 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 134856468 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 15745526 | |
| 14347598 | 10.6% | |
| e | 12341228 | 9.2% |
| n | 12321555 | 9.1% |
| s | 8760767 | 6.5% |
| r | 8698780 | 6.5% |
| A | 8444261 | 6.3% |
| l | 6149361 | 4.6% |
| t | 6014907 | 4.5% |
| c | 5445739 | 4.0% |
| Other values (27) | 36586746 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 134856468 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 15745526 | |
| 14347598 | 10.6% | |
| e | 12341228 | 9.2% |
| n | 12321555 | 9.1% |
| s | 8760767 | 6.5% |
| r | 8698780 | 6.5% |
| A | 8444261 | 6.3% |
| l | 6149361 | 4.6% |
| t | 6014907 | 4.5% |
| c | 5445739 | 4.0% |
| Other values (27) | 36586746 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 134856468 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 15745526 | |
| 14347598 | 10.6% | |
| e | 12341228 | 9.2% |
| n | 12321555 | 9.1% |
| s | 8760767 | 6.5% |
| r | 8698780 | 6.5% |
| A | 8444261 | 6.3% |
| l | 6149361 | 4.6% |
| t | 6014907 | 4.5% |
| c | 5445739 | 4.0% |
| Other values (27) | 36586746 |
tail_number
Text
| Distinct | 5963 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 456.6 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.997429 |
| Min length | 2 |
Characters and Unicode
| Total characters | 40442901 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | N605LR |
|---|---|
| 2nd row | N605LR |
| 3rd row | N331PQ |
| 4th row | N906XJ |
| 5th row | N337PQ |
| Value | Count | Frequency (%) |
| n488ha | 3327 | < 0.1% |
| n487ha | 3315 | < 0.1% |
| n486ha | 3306 | < 0.1% |
| n483ha | 3222 | < 0.1% |
| n484ha | 3221 | < 0.1% |
| n485ha | 3199 | < 0.1% |
| n479ha | 3190 | < 0.1% |
| n475ha | 3160 | < 0.1% |
| n495ha | 3150 | < 0.1% |
| n480ha | 3107 | < 0.1% |
| Other values (5953) | 6711176 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 9256833 | |
| 8 | 2779476 | 6.9% |
| 7 | 2486716 | 6.1% |
| 2 | 2477902 | 6.1% |
| 3 | 2475176 | 6.1% |
| 5 | 2146874 | 5.3% |
| 1 | 2140182 | 5.3% |
| 9 | 2098272 | 5.2% |
| 6 | 2056950 | 5.1% |
| 4 | 2047108 | 5.1% |
| Other values (25) | 10477412 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 40442901 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 9256833 | |
| 8 | 2779476 | 6.9% |
| 7 | 2486716 | 6.1% |
| 2 | 2477902 | 6.1% |
| 3 | 2475176 | 6.1% |
| 5 | 2146874 | 5.3% |
| 1 | 2140182 | 5.3% |
| 9 | 2098272 | 5.2% |
| 6 | 2056950 | 5.1% |
| 4 | 2047108 | 5.1% |
| Other values (25) | 10477412 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 40442901 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 9256833 | |
| 8 | 2779476 | 6.9% |
| 7 | 2486716 | 6.1% |
| 2 | 2477902 | 6.1% |
| 3 | 2475176 | 6.1% |
| 5 | 2146874 | 5.3% |
| 1 | 2140182 | 5.3% |
| 9 | 2098272 | 5.2% |
| 6 | 2056950 | 5.1% |
| 4 | 2047108 | 5.1% |
| Other values (25) | 10477412 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 40442901 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 9256833 | |
| 8 | 2779476 | 6.9% |
| 7 | 2486716 | 6.1% |
| 2 | 2477902 | 6.1% |
| 3 | 2475176 | 6.1% |
| 5 | 2146874 | 5.3% |
| 1 | 2140182 | 5.3% |
| 9 | 2098272 | 5.2% |
| 6 | 2056950 | 5.1% |
| 4 | 2047108 | 5.1% |
| Other values (25) | 10477412 |
dep_airport
Categorical
HIGH CARDINALITY 
| Distinct | 350 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 64.3 MiB |
| ATL | 332934 |
|---|---|
| DEN | 284200 |
| DFW | 280021 |
| ORD | 255071 |
| CLT | 192870 |
| Other values (345) |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 20230119 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BDL |
|---|---|
| 2nd row | BDL |
| 3rd row | BDL |
| 4th row | BDL |
| 5th row | BDL |
Common Values
| Value | Count | Frequency (%) |
| ATL | 332934 | 4.9% |
| DEN | 284200 | 4.2% |
| DFW | 280021 | 4.2% |
| ORD | 255071 | 3.8% |
| CLT | 192870 | 2.9% |
| LAX | 192259 | 2.9% |
| LAS | 188206 | 2.8% |
| PHX | 175144 | 2.6% |
| SEA | 162441 | 2.4% |
| MCO | 161846 | 2.4% |
| Other values (340) | 4518381 |
Length
| Value | Count | Frequency (%) |
| atl | 332934 | 4.9% |
| den | 284200 | 4.2% |
| dfw | 280021 | 4.2% |
| ord | 255071 | 3.8% |
| clt | 192870 | 2.9% |
| lax | 192259 | 2.9% |
| las | 188206 | 2.8% |
| phx | 175144 | 2.6% |
| sea | 162441 | 2.4% |
| mco | 161846 | 2.4% |
| Other values (340) | 4518381 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 2310554 | 11.4% |
| L | 1868908 | 9.2% |
| S | 1731880 | 8.6% |
| D | 1586087 | 7.8% |
| T | 1071692 | 5.3% |
| O | 1033702 | 5.1% |
| C | 1021107 | 5.0% |
| M | 905264 | 4.5% |
| F | 835546 | 4.1% |
| W | 789661 | 3.9% |
| Other values (16) | 7075718 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 20230119 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 2310554 | 11.4% |
| L | 1868908 | 9.2% |
| S | 1731880 | 8.6% |
| D | 1586087 | 7.8% |
| T | 1071692 | 5.3% |
| O | 1033702 | 5.1% |
| C | 1021107 | 5.0% |
| M | 905264 | 4.5% |
| F | 835546 | 4.1% |
| W | 789661 | 3.9% |
| Other values (16) | 7075718 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 20230119 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 2310554 | 11.4% |
| L | 1868908 | 9.2% |
| S | 1731880 | 8.6% |
| D | 1586087 | 7.8% |
| T | 1071692 | 5.3% |
| O | 1033702 | 5.1% |
| C | 1021107 | 5.0% |
| M | 905264 | 4.5% |
| F | 835546 | 4.1% |
| W | 789661 | 3.9% |
| Other values (16) | 7075718 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 20230119 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 2310554 | 11.4% |
| L | 1868908 | 9.2% |
| S | 1731880 | 8.6% |
| D | 1586087 | 7.8% |
| T | 1071692 | 5.3% |
| O | 1033702 | 5.1% |
| C | 1021107 | 5.0% |
| M | 905264 | 4.5% |
| F | 835546 | 4.1% |
| W | 789661 | 3.9% |
| Other values (16) | 7075718 |
dep_cityname
Categorical
HIGH CARDINALITY 
| Distinct | 344 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 64.3 MiB |
| Chicago, IL | 338766 |
|---|---|
| Atlanta, GA | 332934 |
| New York, NY | 288421 |
| Denver, CO | 284200 |
| Dallas/Fort Worth, TX | 280021 |
| Other values (339) |
Length
| Max length | 34 |
|---|---|
| Median length | 29 |
| Mean length | 13.045103 |
| Min length | 8 |
Characters and Unicode
| Total characters | 87967995 |
|---|---|
| Distinct characters | 57 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Hartford, CT |
|---|---|
| 2nd row | Hartford, CT |
| 3rd row | Hartford, CT |
| 4th row | Hartford, CT |
| 5th row | Hartford, CT |
Common Values
| Value | Count | Frequency (%) |
| Chicago, IL | 338766 | 5.0% |
| Atlanta, GA | 332934 | 4.9% |
| New York, NY | 288421 | 4.3% |
| Denver, CO | 284200 | 4.2% |
| Dallas/Fort Worth, TX | 280021 | 4.2% |
| Charlotte, NC | 192870 | 2.9% |
| Los Angeles, CA | 192259 | 2.9% |
| Las Vegas, NV | 188206 | 2.8% |
| Washington, DC | 186676 | 2.8% |
| Phoenix, AZ | 180547 | 2.7% |
| Other values (334) | 4278473 |
Length
| Value | Count | Frequency (%) |
| ca | 729623 | 4.6% |
| tx | 707547 | 4.5% |
| fl | 600100 | 3.8% |
| ny | 367049 | 2.3% |
| ga | 357006 | 2.3% |
| san | 352215 | 2.2% |
| il | 351390 | 2.2% |
| chicago | 338766 | 2.2% |
| new | 337488 | 2.1% |
| atlanta | 332934 | 2.1% |
| Other values (418) | 11242910 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8973655 | 10.2% | |
| , | 6743373 | 7.7% |
| a | 6726622 | 7.6% |
| o | 4843241 | 5.5% |
| e | 4645969 | 5.3% |
| n | 4315840 | 4.9% |
| t | 4204796 | 4.8% |
| l | 3879735 | 4.4% |
| i | 3337081 | 3.8% |
| r | 3178153 | 3.6% |
| Other values (47) | 37119530 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 87967995 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 8973655 | 10.2% | |
| , | 6743373 | 7.7% |
| a | 6726622 | 7.6% |
| o | 4843241 | 5.5% |
| e | 4645969 | 5.3% |
| n | 4315840 | 4.9% |
| t | 4204796 | 4.8% |
| l | 3879735 | 4.4% |
| i | 3337081 | 3.8% |
| r | 3178153 | 3.6% |
| Other values (47) | 37119530 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 87967995 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 8973655 | 10.2% | |
| , | 6743373 | 7.7% |
| a | 6726622 | 7.6% |
| o | 4843241 | 5.5% |
| e | 4645969 | 5.3% |
| n | 4315840 | 4.9% |
| t | 4204796 | 4.8% |
| l | 3879735 | 4.4% |
| i | 3337081 | 3.8% |
| r | 3178153 | 3.6% |
| Other values (47) | 37119530 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 87967995 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 8973655 | 10.2% | |
| , | 6743373 | 7.7% |
| a | 6726622 | 7.6% |
| o | 4843241 | 5.5% |
| e | 4645969 | 5.3% |
| n | 4315840 | 4.9% |
| t | 4204796 | 4.8% |
| l | 3879735 | 4.4% |
| i | 3337081 | 3.8% |
| r | 3178153 | 3.6% |
| Other values (47) | 37119530 |
deptime_label
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.9 MiB |
| Morning | |
|---|---|
| Afternoon | |
| Evening | |
| Night | 211159 |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.6383132 |
| Min length | 5 |
Characters and Unicode
| Total characters | 51507995 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Morning |
|---|---|
| 2nd row | Morning |
| 3rd row | Morning |
| 4th row | Morning |
| 5th row | Morning |
Common Values
| Value | Count | Frequency (%) |
| Morning | 2611546 | |
| Afternoon | 2363351 | |
| Evening | 1557317 | |
| Night | 211159 | 3.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| morning | 2611546 | |
| afternoon | 2363351 | |
| evening | 1557317 | |
| night | 211159 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 13064428 | |
| o | 7338248 | |
| r | 4974897 | 9.7% |
| i | 4380022 | 8.5% |
| g | 4380022 | 8.5% |
| e | 3920668 | 7.6% |
| M | 2611546 | 5.1% |
| t | 2574510 | 5.0% |
| A | 2363351 | 4.6% |
| f | 2363351 | 4.6% |
| Other values (4) | 3536952 | 6.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 51507995 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 13064428 | |
| o | 7338248 | |
| r | 4974897 | 9.7% |
| i | 4380022 | 8.5% |
| g | 4380022 | 8.5% |
| e | 3920668 | 7.6% |
| M | 2611546 | 5.1% |
| t | 2574510 | 5.0% |
| A | 2363351 | 4.6% |
| f | 2363351 | 4.6% |
| Other values (4) | 3536952 | 6.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 51507995 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 13064428 | |
| o | 7338248 | |
| r | 4974897 | 9.7% |
| i | 4380022 | 8.5% |
| g | 4380022 | 8.5% |
| e | 3920668 | 7.6% |
| M | 2611546 | 5.1% |
| t | 2574510 | 5.0% |
| A | 2363351 | 4.6% |
| f | 2363351 | 4.6% |
| Other values (4) | 3536952 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 51507995 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 13064428 | |
| o | 7338248 | |
| r | 4974897 | 9.7% |
| i | 4380022 | 8.5% |
| g | 4380022 | 8.5% |
| e | 3920668 | 7.6% |
| M | 2611546 | 5.1% |
| t | 2574510 | 5.0% |
| A | 2363351 | 4.6% |
| f | 2363351 | 4.6% |
| Other values (4) | 3536952 | 6.9% |
dep_delay
Real number (ℝ)
ZEROS 
| Distinct | 1854 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.201062 |
| Minimum | -99 |
|---|---|
| Maximum | 4413 |
| Zeros | 314586 |
| Zeros (%) | 4.7% |
| Negative | 3873030 |
| Negative (%) | 57.4% |
| Memory size | 102.9 MiB |
Quantile statistics
| Minimum | -99 |
|---|---|
| 5-th percentile | -10 |
| Q1 | -5 |
| median | -2 |
| Q3 | 9 |
| 95-th percentile | 78 |
| Maximum | 4413 |
| Range | 4512 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 55.079476 |
|---|---|
| Coefficient of variation (CV) | 4.5143181 |
| Kurtosis | 268.21142 |
| Mean | 12.201062 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 12.011293 |
| Sum | 82276314 |
| Variance | 3033.7486 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -5 | 504334 | 7.5% |
| -4 | 470786 | 7.0% |
| -3 | 455314 | 6.8% |
| -2 | 413301 | 6.1% |
| -6 | 402405 | 6.0% |
| -1 | 369789 | 5.5% |
| -7 | 340579 | 5.1% |
| 0 | 314586 | 4.7% |
| -8 | 272730 | 4.0% |
| -9 | 202664 | 3.0% |
| Other values (1844) | 2996885 |
| Value | Count | Frequency (%) |
| -99 | 1 | < 0.1% |
| -72 | 1 | < 0.1% |
| -68 | 1 | < 0.1% |
| -59 | 3 | |
| -55 | 1 | < 0.1% |
| -53 | 1 | < 0.1% |
| -52 | 2 | |
| -51 | 1 | < 0.1% |
| -50 | 1 | < 0.1% |
| -49 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 4413 | 1 | |
| 3786 | 1 | |
| 3695 | 1 | |
| 3518 | 1 | |
| 3445 | 1 | |
| 3343 | 1 | |
| 3249 | 1 | |
| 3238 | 1 | |
| 3221 | 1 | |
| 3024 | 1 |
dep_delay_tag
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 424.4 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 6743373 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 4187616 | |
| 1 | 2555757 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 4187616 | |
| 1 | 2555757 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4187616 | |
| 1 | 2555757 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6743373 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4187616 | |
| 1 | 2555757 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6743373 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4187616 | |
| 1 | 2555757 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6743373 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4187616 | |
| 1 | 2555757 |
dep_delay_type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.9 MiB |
| Low <5min | |
|---|---|
| Medium >15min | |
| Hight >60min | 455669 |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 9.723525 |
| Min length | 9 |
Characters and Unicode
| Total characters | 65569356 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Low <5min |
|---|---|
| 2nd row | Low <5min |
| 3rd row | Low <5min |
| 4th row | Low <5min |
| 5th row | Low <5min |
Common Values
| Value | Count | Frequency (%) |
| Low <5min | 5409706 | |
| Medium >15min | 877998 | 13.0% |
| Hight >60min | 455669 | 6.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| low | 5409706 | |
| 5min | 5409706 | |
| medium | 877998 | 6.5% |
| 15min | 877998 | 6.5% |
| hight | 455669 | 3.4% |
| 60min | 455669 | 3.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 8077040 | |
| m | 7621371 | |
| 6743373 | ||
| n | 6743373 | |
| 5 | 6287704 | |
| L | 5409706 | |
| w | 5409706 | |
| < | 5409706 | |
| o | 5409706 | |
| > | 1333667 | 2.0% |
| Other values (11) | 7124004 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 65569356 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 8077040 | |
| m | 7621371 | |
| 6743373 | ||
| n | 6743373 | |
| 5 | 6287704 | |
| L | 5409706 | |
| w | 5409706 | |
| < | 5409706 | |
| o | 5409706 | |
| > | 1333667 | 2.0% |
| Other values (11) | 7124004 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 65569356 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 8077040 | |
| m | 7621371 | |
| 6743373 | ||
| n | 6743373 | |
| 5 | 6287704 | |
| L | 5409706 | |
| w | 5409706 | |
| < | 5409706 | |
| o | 5409706 | |
| > | 1333667 | 2.0% |
| Other values (11) | 7124004 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 65569356 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 8077040 | |
| m | 7621371 | |
| 6743373 | ||
| n | 6743373 | |
| 5 | 6287704 | |
| L | 5409706 | |
| w | 5409706 | |
| < | 5409706 | |
| o | 5409706 | |
| > | 1333667 | 2.0% |
| Other values (11) | 7124004 |
arr_airport
Categorical
HIGH CARDINALITY 
| Distinct | 350 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 64.3 MiB |
| ATL | 332939 |
|---|---|
| DEN | 283563 |
| DFW | 279729 |
| ORD | 254775 |
| CLT | 192910 |
| Other values (345) |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 20230119 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | LGA |
|---|---|
| 2nd row | LGA |
| 3rd row | LGA |
| 4th row | LGA |
| 5th row | LGA |
Common Values
| Value | Count | Frequency (%) |
| ATL | 332939 | 4.9% |
| DEN | 283563 | 4.2% |
| DFW | 279729 | 4.1% |
| ORD | 254775 | 3.8% |
| CLT | 192910 | 2.9% |
| LAX | 192415 | 2.9% |
| LAS | 188243 | 2.8% |
| PHX | 175196 | 2.6% |
| SEA | 162323 | 2.4% |
| MCO | 161373 | 2.4% |
| Other values (340) | 4519907 |
Length
| Value | Count | Frequency (%) |
| atl | 332939 | 4.9% |
| den | 283563 | 4.2% |
| dfw | 279729 | 4.1% |
| ord | 254775 | 3.8% |
| clt | 192910 | 2.9% |
| lax | 192415 | 2.9% |
| las | 188243 | 2.8% |
| phx | 175196 | 2.6% |
| sea | 162323 | 2.4% |
| mco | 161373 | 2.4% |
| Other values (340) | 4519907 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 2310578 | 11.4% |
| L | 1868986 | 9.2% |
| S | 1733219 | 8.6% |
| D | 1584866 | 7.8% |
| T | 1072284 | 5.3% |
| O | 1032982 | 5.1% |
| C | 1021071 | 5.0% |
| M | 904964 | 4.5% |
| F | 835169 | 4.1% |
| W | 789105 | 3.9% |
| Other values (16) | 7076895 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 20230119 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 2310578 | 11.4% |
| L | 1868986 | 9.2% |
| S | 1733219 | 8.6% |
| D | 1584866 | 7.8% |
| T | 1072284 | 5.3% |
| O | 1032982 | 5.1% |
| C | 1021071 | 5.0% |
| M | 904964 | 4.5% |
| F | 835169 | 4.1% |
| W | 789105 | 3.9% |
| Other values (16) | 7076895 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 20230119 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 2310578 | 11.4% |
| L | 1868986 | 9.2% |
| S | 1733219 | 8.6% |
| D | 1584866 | 7.8% |
| T | 1072284 | 5.3% |
| O | 1032982 | 5.1% |
| C | 1021071 | 5.0% |
| M | 904964 | 4.5% |
| F | 835169 | 4.1% |
| W | 789105 | 3.9% |
| Other values (16) | 7076895 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 20230119 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 2310578 | 11.4% |
| L | 1868986 | 9.2% |
| S | 1733219 | 8.6% |
| D | 1584866 | 7.8% |
| T | 1072284 | 5.3% |
| O | 1032982 | 5.1% |
| C | 1021071 | 5.0% |
| M | 904964 | 4.5% |
| F | 835169 | 4.1% |
| W | 789105 | 3.9% |
| Other values (16) | 7076895 |
arr_cityname
Categorical
HIGH CARDINALITY 
| Distinct | 344 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 64.3 MiB |
| Chicago, IL | 338319 |
|---|---|
| Atlanta, GA | 332939 |
| New York, NY | 288152 |
| Denver, CO | 283563 |
| Dallas/Fort Worth, TX | 279729 |
| Other values (339) |
Length
| Max length | 34 |
|---|---|
| Median length | 29 |
| Mean length | 13.045968 |
| Min length | 8 |
Characters and Unicode
| Total characters | 87973825 |
|---|---|
| Distinct characters | 57 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | New York, NY |
|---|---|
| 2nd row | New York, NY |
| 3rd row | New York, NY |
| 4th row | New York, NY |
| 5th row | New York, NY |
Common Values
| Value | Count | Frequency (%) |
| Chicago, IL | 338319 | 5.0% |
| Atlanta, GA | 332939 | 4.9% |
| New York, NY | 288152 | 4.3% |
| Denver, CO | 283563 | 4.2% |
| Dallas/Fort Worth, TX | 279729 | 4.1% |
| Charlotte, NC | 192910 | 2.9% |
| Los Angeles, CA | 192415 | 2.9% |
| Las Vegas, NV | 188243 | 2.8% |
| Washington, DC | 186597 | 2.8% |
| Phoenix, AZ | 180608 | 2.7% |
| Other values (334) | 4279898 |
Length
| Value | Count | Frequency (%) |
| ca | 730226 | 4.6% |
| tx | 707050 | 4.5% |
| fl | 599542 | 3.8% |
| ny | 366899 | 2.3% |
| ga | 357012 | 2.3% |
| san | 352643 | 2.2% |
| il | 350948 | 2.2% |
| chicago | 338319 | 2.2% |
| new | 337294 | 2.1% |
| atlanta | 332939 | 2.1% |
| Other values (418) | 11245153 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8974652 | 10.2% | |
| , | 6743373 | 7.7% |
| a | 6727590 | 7.6% |
| o | 4842414 | 5.5% |
| e | 4646558 | 5.3% |
| n | 4316941 | 4.9% |
| t | 4205094 | 4.8% |
| l | 3880056 | 4.4% |
| i | 3338091 | 3.8% |
| r | 3176704 | 3.6% |
| Other values (47) | 37122352 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 87973825 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 8974652 | 10.2% | |
| , | 6743373 | 7.7% |
| a | 6727590 | 7.6% |
| o | 4842414 | 5.5% |
| e | 4646558 | 5.3% |
| n | 4316941 | 4.9% |
| t | 4205094 | 4.8% |
| l | 3880056 | 4.4% |
| i | 3338091 | 3.8% |
| r | 3176704 | 3.6% |
| Other values (47) | 37122352 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 87973825 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 8974652 | 10.2% | |
| , | 6743373 | 7.7% |
| a | 6727590 | 7.6% |
| o | 4842414 | 5.5% |
| e | 4646558 | 5.3% |
| n | 4316941 | 4.9% |
| t | 4205094 | 4.8% |
| l | 3880056 | 4.4% |
| i | 3338091 | 3.8% |
| r | 3176704 | 3.6% |
| Other values (47) | 37122352 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 87973825 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 8974652 | 10.2% | |
| , | 6743373 | 7.7% |
| a | 6727590 | 7.6% |
| o | 4842414 | 5.5% |
| e | 4646558 | 5.3% |
| n | 4316941 | 4.9% |
| t | 4205094 | 4.8% |
| l | 3880056 | 4.4% |
| i | 3338091 | 3.8% |
| r | 3176704 | 3.6% |
| Other values (47) | 37122352 |
arr_delay
Real number (ℝ)
ZEROS 
| Distinct | 1880 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.6272352 |
| Minimum | -119 |
|---|---|
| Maximum | 4405 |
| Zeros | 124753 |
| Zeros (%) | 1.9% |
| Negative | 4146092 |
| Negative (%) | 61.5% |
| Memory size | 102.9 MiB |
Quantile statistics
| Minimum | -119 |
|---|---|
| 5-th percentile | -27 |
| Q1 | -15 |
| median | -6 |
| Q3 | 9 |
| 95-th percentile | 78 |
| Maximum | 4405 |
| Range | 4524 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 57.079037 |
|---|---|
| Coefficient of variation (CV) | 8.6127978 |
| Kurtosis | 235.0407 |
| Mean | 6.6272352 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 10.934328 |
| Sum | 44689919 |
| Variance | 3258.0165 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -11 | 189214 | 2.8% |
| -10 | 189118 | 2.8% |
| -12 | 188072 | 2.8% |
| -9 | 185850 | 2.8% |
| -13 | 184322 | 2.7% |
| -8 | 182424 | 2.7% |
| -14 | 179312 | 2.7% |
| -7 | 176691 | 2.6% |
| -15 | 172027 | 2.6% |
| -6 | 170459 | 2.5% |
| Other values (1870) | 4925884 |
| Value | Count | Frequency (%) |
| -119 | 1 | < 0.1% |
| -98 | 1 | < 0.1% |
| -97 | 1 | < 0.1% |
| -96 | 1 | < 0.1% |
| -94 | 1 | < 0.1% |
| -92 | 2 | < 0.1% |
| -91 | 1 | < 0.1% |
| -89 | 1 | < 0.1% |
| -88 | 1 | < 0.1% |
| -86 | 5 |
| Value | Count | Frequency (%) |
| 4405 | 1 | |
| 3795 | 1 | |
| 3680 | 1 | |
| 3502 | 1 | |
| 3424 | 1 | |
| 3337 | 1 | |
| 3246 | 1 | |
| 3241 | 1 | |
| 3237 | 1 | |
| 3063 | 1 |
arr_delay_type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.9 MiB |
| Low <5min | |
|---|---|
| Medium >15min | |
| Hight >60min | 453687 |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 9.7273839 |
| Min length | 9 |
Characters and Unicode
| Total characters | 65595378 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Low <5min |
|---|---|
| 2nd row | Low <5min |
| 3rd row | Low <5min |
| 4th row | Low <5min |
| 5th row | Low <5min |
Common Values
| Value | Count | Frequency (%) |
| Low <5min | 5403696 | |
| Medium >15min | 885990 | 13.1% |
| Hight >60min | 453687 | 6.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| low | 5403696 | |
| 5min | 5403696 | |
| medium | 885990 | 6.6% |
| 15min | 885990 | 6.6% |
| hight | 453687 | 3.4% |
| 60min | 453687 | 3.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 8083050 | |
| m | 7629363 | |
| 6743373 | ||
| n | 6743373 | |
| 5 | 6289686 | |
| L | 5403696 | |
| w | 5403696 | |
| < | 5403696 | |
| o | 5403696 | |
| > | 1339677 | 2.0% |
| Other values (11) | 7152072 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 65595378 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 8083050 | |
| m | 7629363 | |
| 6743373 | ||
| n | 6743373 | |
| 5 | 6289686 | |
| L | 5403696 | |
| w | 5403696 | |
| < | 5403696 | |
| o | 5403696 | |
| > | 1339677 | 2.0% |
| Other values (11) | 7152072 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 65595378 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 8083050 | |
| m | 7629363 | |
| 6743373 | ||
| n | 6743373 | |
| 5 | 6289686 | |
| L | 5403696 | |
| w | 5403696 | |
| < | 5403696 | |
| o | 5403696 | |
| > | 1339677 | 2.0% |
| Other values (11) | 7152072 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 65595378 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 8083050 | |
| m | 7629363 | |
| 6743373 | ||
| n | 6743373 | |
| 5 | 6289686 | |
| L | 5403696 | |
| w | 5403696 | |
| < | 5403696 | |
| o | 5403696 | |
| > | 1339677 | 2.0% |
| Other values (11) | 7152072 |
flight_duration
Real number (ℝ)
| Distinct | 724 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 140.2981 |
| Minimum | 0 |
|---|---|
| Maximum | 795 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 102.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 58 |
| Q1 | 87 |
| median | 124 |
| Q3 | 171 |
| 95-th percentile | 298 |
| Maximum | 795 |
| Range | 795 |
| Interquartile range (IQR) | 84 |
Descriptive statistics
| Standard deviation | 72.872159 |
|---|---|
| Coefficient of variation (CV) | 0.51940947 |
| Kurtosis | 2.4147338 |
| Mean | 140.2981 |
| Median Absolute Deviation (MAD) | 41 |
| Skewness | 1.3861544 |
| Sum | 9.4608239 × 108 |
| Variance | 5310.3516 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 81 | 53005 | 0.8% |
| 82 | 52585 | 0.8% |
| 79 | 52293 | 0.8% |
| 80 | 52180 | 0.8% |
| 83 | 51943 | 0.8% |
| 78 | 51745 | 0.8% |
| 84 | 51543 | 0.8% |
| 77 | 51196 | 0.8% |
| 85 | 51030 | 0.8% |
| 76 | 50814 | 0.8% |
| Other values (714) | 6225039 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 15 | 3 | < 0.1% |
| 16 | 9 | < 0.1% |
| 17 | 28 | < 0.1% |
| 18 | 36 | |
| 19 | 45 | |
| 20 | 49 | |
| 21 | 65 | |
| 22 | 81 | |
| 23 | 73 |
| Value | Count | Frequency (%) |
| 795 | 1 | |
| 759 | 1 | |
| 749 | 1 | |
| 744 | 1 | |
| 742 | 2 | |
| 736 | 1 | |
| 735 | 2 | |
| 734 | 1 | |
| 732 | 1 | |
| 731 | 1 |
distance_type
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.9 MiB |
| Short Haul >1500Mi | |
|---|---|
| Medium Haul <3000Mi | |
| Long Haul <6000Mi | 14061 |
Length
| Max length | 19 |
|---|---|
| Median length | 18 |
| Mean length | 18.12503 |
| Min length | 17 |
Characters and Unicode
| Total characters | 122223837 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Short Haul >1500Mi |
|---|---|
| 2nd row | Short Haul >1500Mi |
| 3rd row | Short Haul >1500Mi |
| 4th row | Short Haul >1500Mi |
| 5th row | Short Haul >1500Mi |
Common Values
| Value | Count | Frequency (%) |
| Short Haul >1500Mi | 5872128 | |
| Medium Haul <3000Mi | 857184 | 12.7% |
| Long Haul <6000Mi | 14061 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| haul | 6743373 | |
| short | 5872128 | |
| 1500mi | 5872128 | |
| medium | 857184 | 4.2% |
| 3000mi | 857184 | 4.2% |
| long | 14061 | 0.1% |
| 6000mi | 14061 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 14357991 | 11.7% |
| 13486746 | 11.0% | |
| i | 7600557 | 6.2% |
| M | 7600557 | 6.2% |
| u | 7600557 | 6.2% |
| H | 6743373 | 5.5% |
| a | 6743373 | 5.5% |
| l | 6743373 | 5.5% |
| o | 5886189 | 4.8% |
| S | 5872128 | 4.8% |
| Other values (15) | 39588993 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 122223837 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 14357991 | 11.7% |
| 13486746 | 11.0% | |
| i | 7600557 | 6.2% |
| M | 7600557 | 6.2% |
| u | 7600557 | 6.2% |
| H | 6743373 | 5.5% |
| a | 6743373 | 5.5% |
| l | 6743373 | 5.5% |
| o | 5886189 | 4.8% |
| S | 5872128 | 4.8% |
| Other values (15) | 39588993 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 122223837 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 14357991 | 11.7% |
| 13486746 | 11.0% | |
| i | 7600557 | 6.2% |
| M | 7600557 | 6.2% |
| u | 7600557 | 6.2% |
| H | 6743373 | 5.5% |
| a | 6743373 | 5.5% |
| l | 6743373 | 5.5% |
| o | 5886189 | 4.8% |
| S | 5872128 | 4.8% |
| Other values (15) | 39588993 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 122223837 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 14357991 | 11.7% |
| 13486746 | 11.0% | |
| i | 7600557 | 6.2% |
| M | 7600557 | 6.2% |
| u | 7600557 | 6.2% |
| H | 6743373 | 5.5% |
| a | 6743373 | 5.5% |
| l | 6743373 | 5.5% |
| o | 5886189 | 4.8% |
| S | 5872128 | 4.8% |
| Other values (15) | 39588993 |
delay_carrier
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 1650 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.1698273 |
| Minimum | 0 |
|---|---|
| Maximum | 3957 |
| Zeros | 5955483 |
| Zeros (%) | 88.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 102.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 23 |
| Maximum | 3957 |
| Range | 3957 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 36.457406 |
|---|---|
| Coefficient of variation (CV) | 7.0519581 |
| Kurtosis | 861.83964 |
| Mean | 5.1698273 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 22.75351 |
| Sum | 34862074 |
| Variance | 1329.1424 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5955483 | |
| 1 | 25940 | 0.4% |
| 2 | 25697 | 0.4% |
| 3 | 24727 | 0.4% |
| 6 | 24624 | 0.4% |
| 4 | 23830 | 0.4% |
| 15 | 23740 | 0.4% |
| 7 | 22964 | 0.3% |
| 5 | 22733 | 0.3% |
| 8 | 21392 | 0.3% |
| Other values (1640) | 572243 | 8.5% |
| Value | Count | Frequency (%) |
| 0 | 5955483 | |
| 1 | 25940 | 0.4% |
| 2 | 25697 | 0.4% |
| 3 | 24727 | 0.4% |
| 4 | 23830 | 0.4% |
| 5 | 22733 | 0.3% |
| 6 | 24624 | 0.4% |
| 7 | 22964 | 0.3% |
| 8 | 21392 | 0.3% |
| 9 | 20398 | 0.3% |
| Value | Count | Frequency (%) |
| 3957 | 1 | |
| 3786 | 1 | |
| 3502 | 1 | |
| 3424 | 1 | |
| 3337 | 1 | |
| 3246 | 1 | |
| 3221 | 1 | |
| 3045 | 1 | |
| 3024 | 1 | |
| 2998 | 1 |
delay_weather
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 1073 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.74285391 |
| Minimum | 0 |
|---|---|
| Maximum | 1860 |
| Zeros | 6671374 |
| Zeros (%) | 98.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 102.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 1860 |
| Range | 1860 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 14.353961 |
|---|---|
| Coefficient of variation (CV) | 19.322724 |
| Kurtosis | 2949.4932 |
| Mean | 0.74285391 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 45.858185 |
| Sum | 5009341 |
| Variance | 206.0362 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 6671374 | |
| 15 | 1453 | < 0.1% |
| 16 | 1340 | < 0.1% |
| 6 | 1325 | < 0.1% |
| 17 | 1298 | < 0.1% |
| 7 | 1269 | < 0.1% |
| 18 | 1258 | < 0.1% |
| 10 | 1241 | < 0.1% |
| 19 | 1238 | < 0.1% |
| 8 | 1226 | < 0.1% |
| Other values (1063) | 60351 | 0.9% |
| Value | Count | Frequency (%) |
| 0 | 6671374 | |
| 1 | 1114 | < 0.1% |
| 2 | 1188 | < 0.1% |
| 3 | 1168 | < 0.1% |
| 4 | 1140 | < 0.1% |
| 5 | 1104 | < 0.1% |
| 6 | 1325 | < 0.1% |
| 7 | 1269 | < 0.1% |
| 8 | 1226 | < 0.1% |
| 9 | 1183 | < 0.1% |
| Value | Count | Frequency (%) |
| 1860 | 1 | |
| 1747 | 1 | |
| 1738 | 1 | |
| 1728 | 1 | |
| 1653 | 1 | |
| 1643 | 1 | |
| 1609 | 1 | |
| 1561 | 1 | |
| 1529 | 1 | |
| 1522 | 1 |
delay_nas
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 837 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.5669688 |
| Minimum | 0 |
|---|---|
| Maximum | 1708 |
| Zeros | 6082588 |
| Zeros (%) | 90.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 102.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 16 |
| Maximum | 1708 |
| Range | 1708 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 15.004876 |
|---|---|
| Coefficient of variation (CV) | 5.8453674 |
| Kurtosis | 1142.3372 |
| Mean | 2.5669688 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 22.646267 |
| Sum | 17310028 |
| Variance | 225.14629 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 6082588 | |
| 1 | 36929 | 0.5% |
| 2 | 26309 | 0.4% |
| 15 | 25623 | 0.4% |
| 3 | 24666 | 0.4% |
| 16 | 23136 | 0.3% |
| 4 | 23077 | 0.3% |
| 5 | 21730 | 0.3% |
| 17 | 21260 | 0.3% |
| 6 | 20248 | 0.3% |
| Other values (827) | 437807 | 6.5% |
| Value | Count | Frequency (%) |
| 0 | 6082588 | |
| 1 | 36929 | 0.5% |
| 2 | 26309 | 0.4% |
| 3 | 24666 | 0.4% |
| 4 | 23077 | 0.3% |
| 5 | 21730 | 0.3% |
| 6 | 20248 | 0.3% |
| 7 | 19252 | 0.3% |
| 8 | 18386 | 0.3% |
| 9 | 17266 | 0.3% |
| Value | Count | Frequency (%) |
| 1708 | 1 | |
| 1660 | 1 | |
| 1651 | 1 | |
| 1515 | 1 | |
| 1487 | 1 | |
| 1421 | 1 | |
| 1409 | 2 | |
| 1407 | 1 | |
| 1402 | 1 | |
| 1401 | 1 |
delay_security
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 201 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.030648905 |
| Minimum | 0 |
|---|---|
| Maximum | 1460 |
| Zeros | 6735225 |
| Zeros (%) | 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 102.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 1460 |
| Range | 1460 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.6289268 |
|---|---|
| Coefficient of variation (CV) | 53.147961 |
| Kurtosis | 179569.94 |
| Mean | 0.030648905 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 288.46366 |
| Sum | 206677 |
| Variance | 2.6534026 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 6735225 | |
| 15 | 333 | < 0.1% |
| 10 | 293 | < 0.1% |
| 16 | 291 | < 0.1% |
| 17 | 283 | < 0.1% |
| 8 | 270 | < 0.1% |
| 7 | 258 | < 0.1% |
| 12 | 257 | < 0.1% |
| 18 | 257 | < 0.1% |
| 9 | 256 | < 0.1% |
| Other values (191) | 5650 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 6735225 | |
| 1 | 184 | < 0.1% |
| 2 | 182 | < 0.1% |
| 3 | 200 | < 0.1% |
| 4 | 186 | < 0.1% |
| 5 | 243 | < 0.1% |
| 6 | 245 | < 0.1% |
| 7 | 258 | < 0.1% |
| 8 | 270 | < 0.1% |
| 9 | 256 | < 0.1% |
| Value | Count | Frequency (%) |
| 1460 | 1 | |
| 1183 | 1 | |
| 885 | 1 | |
| 808 | 1 | |
| 805 | 1 | |
| 600 | 1 | |
| 581 | 1 | |
| 449 | 1 | |
| 376 | 1 | |
| 373 | 1 |
delay_lastaircraft
Real number (ℝ)
ZEROS 
| Distinct | 1349 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.6811338 |
| Minimum | 0 |
|---|---|
| Maximum | 3581 |
| Zeros | 6032255 |
| Zeros (%) | 89.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 102.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 33 |
| Maximum | 3581 |
| Range | 3581 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 30.446536 |
|---|---|
| Coefficient of variation (CV) | 5.359236 |
| Kurtosis | 537.39645 |
| Mean | 5.6811338 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 16.353549 |
| Sum | 38310004 |
| Variance | 926.99158 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 6032255 | |
| 15 | 17684 | 0.3% |
| 16 | 16691 | 0.2% |
| 17 | 15950 | 0.2% |
| 18 | 15240 | 0.2% |
| 19 | 14258 | 0.2% |
| 20 | 13982 | 0.2% |
| 21 | 13288 | 0.2% |
| 14 | 12685 | 0.2% |
| 22 | 12380 | 0.2% |
| Other values (1339) | 578960 | 8.6% |
| Value | Count | Frequency (%) |
| 0 | 6032255 | |
| 1 | 9092 | 0.1% |
| 2 | 9550 | 0.1% |
| 3 | 9331 | 0.1% |
| 4 | 9511 | 0.1% |
| 5 | 9753 | 0.1% |
| 6 | 10693 | 0.2% |
| 7 | 10528 | 0.2% |
| 8 | 10884 | 0.2% |
| 9 | 10977 | 0.2% |
| Value | Count | Frequency (%) |
| 3581 | 1 | |
| 3228 | 1 | |
| 2586 | 1 | |
| 2557 | 1 | |
| 2530 | 1 | |
| 2366 | 1 | |
| 2329 | 1 | |
| 2325 | 1 | |
| 2277 | 1 | |
| 2258 | 1 |
manufacturer
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.9 MiB |
| BOEING | |
|---|---|
| AIRBUS | |
| EMBRAER | |
| CANADAIR REGIONAL JET | |
| DIAMOND AIRCRAFT | 3 |
Length
| Max length | 21 |
|---|---|
| Median length | 6 |
| Mean length | 7.6746489 |
| Min length | 6 |
Characters and Unicode
| Total characters | 51753020 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CANADAIR REGIONAL JET |
|---|---|
| 2nd row | CANADAIR REGIONAL JET |
| 3rd row | CANADAIR REGIONAL JET |
| 4th row | CANADAIR REGIONAL JET |
| 5th row | CANADAIR REGIONAL JET |
Common Values
| Value | Count | Frequency (%) |
| BOEING | 3122309 | |
| AIRBUS | 1981785 | |
| EMBRAER | 949742 | 14.1% |
| CANADAIR REGIONAL JET | 689534 | 10.2% |
| DIAMOND AIRCRAFT | 3 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| boeing | 3122309 | |
| airbus | 1981785 | |
| embraer | 949742 | 11.7% |
| canadair | 689534 | 8.5% |
| regional | 689534 | 8.5% |
| jet | 689534 | 8.5% |
| diamond | 3 | < 0.1% |
| aircraft | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 6483168 | |
| E | 6400861 | |
| B | 6053836 | |
| A | 5689672 | |
| R | 5260343 | |
| N | 4501380 | |
| O | 3811846 | |
| G | 3811843 | |
| S | 1981785 | 3.8% |
| U | 1981785 | 3.8% |
| Other values (8) | 5776501 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 51753020 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| I | 6483168 | |
| E | 6400861 | |
| B | 6053836 | |
| A | 5689672 | |
| R | 5260343 | |
| N | 4501380 | |
| O | 3811846 | |
| G | 3811843 | |
| S | 1981785 | 3.8% |
| U | 1981785 | 3.8% |
| Other values (8) | 5776501 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 51753020 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| I | 6483168 | |
| E | 6400861 | |
| B | 6053836 | |
| A | 5689672 | |
| R | 5260343 | |
| N | 4501380 | |
| O | 3811846 | |
| G | 3811843 | |
| S | 1981785 | 3.8% |
| U | 1981785 | 3.8% |
| Other values (8) | 5776501 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 51753020 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| I | 6483168 | |
| E | 6400861 | |
| B | 6053836 | |
| A | 5689672 | |
| R | 5260343 | |
| N | 4501380 | |
| O | 3811846 | |
| G | 3811843 | |
| S | 1981785 | 3.8% |
| U | 1981785 | 3.8% |
| Other values (8) | 5776501 |
model
Categorical
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.9 MiB |
| 737 NG | |
|---|---|
| 170/175 | |
| A320 | |
| A321 | |
| CRJ | |
| Other values (16) |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 5.074076 |
| Min length | 3 |
Characters and Unicode
| Total characters | 34216387 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CRJ |
|---|---|
| 2nd row | CRJ |
| 3rd row | CRJ |
| 4th row | CRJ |
| 5th row | CRJ |
Common Values
| Value | Count | Frequency (%) |
| 737 NG | 2703483 | |
| 170/175 | 863328 | 12.8% |
| A320 | 769278 | 11.4% |
| A321 | 704641 | 10.4% |
| CRJ | 689534 | 10.2% |
| A319 | 390607 | 5.8% |
| 717 | 184919 | 2.7% |
| 757 | 156147 | 2.3% |
| A220 | 98132 | 1.5% |
| 190/195 | 75262 | 1.1% |
| Other values (11) | 108042 | 1.6% |
Length
| Value | Count | Frequency (%) |
| 737 | 2720609 | |
| ng | 2703483 | |
| 170/175 | 863328 | 9.1% |
| a320 | 769309 | 8.1% |
| a321 | 704641 | 7.4% |
| crj | 689534 | 7.3% |
| a319 | 390607 | 4.1% |
| 717 | 184919 | 2.0% |
| 757 | 156147 | 1.7% |
| a220 | 98132 | 1.0% |
| Other values (10) | 179723 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 7999129 | |
| 3 | 4630530 | |
| 1 | 3179651 | 9.3% |
| 2717090 | 7.9% | |
| N | 2717059 | 7.9% |
| G | 2717059 | 7.9% |
| A | 1995364 | 5.8% |
| 0 | 1827490 | 5.3% |
| 2 | 1670214 | 4.9% |
| 5 | 1118661 | 3.3% |
| Other values (11) | 3644140 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 34216387 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 7 | 7999129 | |
| 3 | 4630530 | |
| 1 | 3179651 | 9.3% |
| 2717090 | 7.9% | |
| N | 2717059 | 7.9% |
| G | 2717059 | 7.9% |
| A | 1995364 | 5.8% |
| 0 | 1827490 | 5.3% |
| 2 | 1670214 | 4.9% |
| 5 | 1118661 | 3.3% |
| Other values (11) | 3644140 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 34216387 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 7 | 7999129 | |
| 3 | 4630530 | |
| 1 | 3179651 | 9.3% |
| 2717090 | 7.9% | |
| N | 2717059 | 7.9% |
| G | 2717059 | 7.9% |
| A | 1995364 | 5.8% |
| 0 | 1827490 | 5.3% |
| 2 | 1670214 | 4.9% |
| 5 | 1118661 | 3.3% |
| Other values (11) | 3644140 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 34216387 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 7 | 7999129 | |
| 3 | 4630530 | |
| 1 | 3179651 | 9.3% |
| 2717090 | 7.9% | |
| N | 2717059 | 7.9% |
| G | 2717059 | 7.9% |
| A | 1995364 | 5.8% |
| 0 | 1827490 | 5.3% |
| 2 | 1670214 | 4.9% |
| 5 | 1118661 | 3.3% |
| Other values (11) | 3644140 |
aicraft_age
Real number (ℝ)
| Distinct | 39 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.480635 |
| Minimum | 1 |
|---|---|
| Maximum | 57 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 102.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 7 |
| median | 12 |
| Q3 | 20 |
| 95-th percentile | 25 |
| Maximum | 57 |
| Range | 56 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 7.8914987 |
|---|---|
| Coefficient of variation (CV) | 0.58539517 |
| Kurtosis | -0.73164272 |
| Mean | 13.480635 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0.29683303 |
| Sum | 90904951 |
| Variance | 62.275752 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 403055 | 6.0% |
| 5 | 399397 | 5.9% |
| 23 | 379335 | 5.6% |
| 10 | 373103 | 5.5% |
| 7 | 355229 | 5.3% |
| 9 | 344486 | 5.1% |
| 6 | 330744 | 4.9% |
| 2 | 327330 | 4.9% |
| 24 | 313636 | 4.7% |
| 18 | 257437 | 3.8% |
| Other values (29) | 3259621 |
| Value | Count | Frequency (%) |
| 1 | 186073 | |
| 2 | 327330 | |
| 3 | 169787 | |
| 4 | 141338 | 2.1% |
| 5 | 399397 | |
| 6 | 330744 | |
| 7 | 355229 | |
| 8 | 403055 | |
| 9 | 344486 | |
| 10 | 373103 |
| Value | Count | Frequency (%) |
| 57 | 826 | < 0.1% |
| 56 | 1062 | < 0.1% |
| 48 | 2360 | < 0.1% |
| 39 | 645 | < 0.1% |
| 38 | 398 | < 0.1% |
| 34 | 9283 | 0.1% |
| 33 | 18807 | |
| 32 | 34391 | |
| 31 | 18053 | |
| 30 | 27339 |
| flightdate | day_of_week | airline | tail_number | dep_airport | dep_cityname | deptime_label | dep_delay | dep_delay_tag | dep_delay_type | arr_airport | arr_cityname | arr_delay | arr_delay_type | flight_duration | distance_type | delay_carrier | delay_weather | delay_nas | delay_security | delay_lastaircraft | manufacturer | model | aicraft_age | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2023-01-02 | Monday | Endeavor Air | N605LR | BDL | Hartford, CT | Morning | -3 | 0 | Low <5min | LGA | New York, NY | -12 | Low <5min | 56 | Short Haul >1500Mi | 0 | 0 | 0 | 0 | 0 | CANADAIR REGIONAL JET | CRJ | 16 |
| 1 | 2023-01-03 | Tuesday | Endeavor Air | N605LR | BDL | Hartford, CT | Morning | -5 | 0 | Low <5min | LGA | New York, NY | -8 | Low <5min | 62 | Short Haul >1500Mi | 0 | 0 | 0 | 0 | 0 | CANADAIR REGIONAL JET | CRJ | 16 |
| 2 | 2023-01-04 | Wednesday | Endeavor Air | N331PQ | BDL | Hartford, CT | Morning | -5 | 0 | Low <5min | LGA | New York, NY | -21 | Low <5min | 49 | Short Haul >1500Mi | 0 | 0 | 0 | 0 | 0 | CANADAIR REGIONAL JET | CRJ | 10 |
| 3 | 2023-01-05 | Thursday | Endeavor Air | N906XJ | BDL | Hartford, CT | Morning | -6 | 0 | Low <5min | LGA | New York, NY | -17 | Low <5min | 54 | Short Haul >1500Mi | 0 | 0 | 0 | 0 | 0 | CANADAIR REGIONAL JET | CRJ | 17 |
| 4 | 2023-01-06 | Friday | Endeavor Air | N337PQ | BDL | Hartford, CT | Morning | -1 | 0 | Low <5min | LGA | New York, NY | -16 | Low <5min | 50 | Short Haul >1500Mi | 0 | 0 | 0 | 0 | 0 | CANADAIR REGIONAL JET | CRJ | 10 |
| 5 | 2023-01-07 | Saturday | Endeavor Air | N336PQ | BDL | Hartford, CT | Morning | -10 | 0 | Low <5min | LGA | New York, NY | -13 | Low <5min | 62 | Short Haul >1500Mi | 0 | 0 | 0 | 0 | 0 | CANADAIR REGIONAL JET | CRJ | 10 |
| 6 | 2023-01-14 | Saturday | Endeavor Air | N311PQ | LGA | New York, NY | Afternoon | -8 | 0 | Low <5min | CVG | Cincinnati, OH | -31 | Low <5min | 117 | Short Haul >1500Mi | 0 | 0 | 0 | 0 | 0 | CANADAIR REGIONAL JET | CRJ | 10 |
| 7 | 2023-01-21 | Saturday | Endeavor Air | N917XJ | LGA | New York, NY | Afternoon | -10 | 0 | Low <5min | CVG | Cincinnati, OH | -25 | Low <5min | 125 | Short Haul >1500Mi | 0 | 0 | 0 | 0 | 0 | CANADAIR REGIONAL JET | CRJ | 16 |
| 8 | 2023-01-28 | Saturday | Endeavor Air | N336PQ | LGA | New York, NY | Afternoon | -5 | 0 | Low <5min | CVG | Cincinnati, OH | -15 | Low <5min | 130 | Short Haul >1500Mi | 0 | 0 | 0 | 0 | 0 | CANADAIR REGIONAL JET | CRJ | 10 |
| 9 | 2023-01-09 | Monday | Endeavor Air | N491PX | LGA | New York, NY | Evening | -7 | 0 | Low <5min | BGM | Binghamton, NY | -3 | Low <5min | 63 | Short Haul >1500Mi | 0 | 0 | 0 | 0 | 0 | CANADAIR REGIONAL JET | CRJ | 4 |
| flightdate | day_of_week | airline | tail_number | dep_airport | dep_cityname | deptime_label | dep_delay | dep_delay_tag | dep_delay_type | arr_airport | arr_cityname | arr_delay | arr_delay_type | flight_duration | distance_type | delay_carrier | delay_weather | delay_nas | delay_security | delay_lastaircraft | manufacturer | model | aicraft_age | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6743394 | 2023-12-31 | Sunday | JetBlue Airways | N937JB | BOS | Boston, MA | Afternoon | 60 | 1 | Medium >15min | BUF | Buffalo, NY | 43 | Medium >15min | 80 | Short Haul >1500Mi | 43 | 0 | 0 | 0 | 0 | AIRBUS | A321 | 10 |
| 6743395 | 2023-12-31 | Sunday | JetBlue Airways | N945JT | SFO | San Francisco, CA | Morning | -8 | 0 | Low <5min | JFK | New York, NY | -14 | Low <5min | 326 | Medium Haul <3000Mi | 0 | 0 | 0 | 0 | 0 | AIRBUS | A321 | 10 |
| 6743396 | 2023-12-31 | Sunday | JetBlue Airways | N558JB | ORH | Worcester, MA | Afternoon | -4 | 0 | Low <5min | FLL | Fort Lauderdale, FL | -35 | Low <5min | 169 | Short Haul >1500Mi | 0 | 0 | 0 | 0 | 0 | AIRBUS | A320 | 24 |
| 6743397 | 2023-12-31 | Sunday | JetBlue Airways | N284JB | BOS | Boston, MA | Afternoon | -5 | 0 | Low <5min | BWI | Baltimore, MD | -20 | Low <5min | 85 | Short Haul >1500Mi | 0 | 0 | 0 | 0 | 0 | EMBRAER | 190/195 | 16 |
| 6743398 | 2023-12-31 | Sunday | JetBlue Airways | N661JB | JFK | New York, NY | Morning | 20 | 1 | Medium >15min | RSW | Fort Myers, FL | -1 | Low <5min | 175 | Short Haul >1500Mi | 0 | 0 | 0 | 0 | 0 | AIRBUS | A320 | 17 |
| 6743399 | 2023-12-31 | Sunday | JetBlue Airways | N903JB | SJU | San Juan, PR | Morning | 4 | 1 | Low <5min | JFK | New York, NY | -33 | Low <5min | 219 | Medium Haul <3000Mi | 0 | 0 | 0 | 0 | 0 | AIRBUS | A321 | 11 |
| 6743400 | 2023-12-31 | Sunday | JetBlue Airways | N535JB | MCO | Orlando, FL | Evening | 113 | 1 | Hight >60min | SJU | San Juan, PR | 100 | Hight >60min | 162 | Short Haul >1500Mi | 4 | 0 | 0 | 0 | 96 | AIRBUS | A320 | 22 |
| 6743401 | 2023-12-31 | Sunday | JetBlue Airways | N354JB | PHL | Philadelphia, PA | Afternoon | -11 | 0 | Low <5min | BOS | Boston, MA | -12 | Low <5min | 73 | Short Haul >1500Mi | 0 | 0 | 0 | 0 | 0 | EMBRAER | 190/195 | 11 |
| 6743402 | 2023-12-31 | Sunday | JetBlue Airways | N768JB | PBI | West Palm Beach/Palm Beach, FL | Afternoon | -7 | 0 | Low <5min | BDL | Hartford, CT | -30 | Low <5min | 158 | Short Haul >1500Mi | 0 | 0 | 0 | 0 | 0 | AIRBUS | A320 | 15 |
| 6743403 | 2023-12-31 | Sunday | JetBlue Airways | N547JB | BDL | Hartford, CT | Morning | -8 | 0 | Low <5min | PBI | West Palm Beach/Palm Beach, FL | -24 | Low <5min | 173 | Short Haul >1500Mi | 0 | 0 | 0 | 0 | 0 | AIRBUS | A320 | 22 |